A Model of USENET Newsgroups Dynamics: Implementation and Results
نویسندگان
چکیده
We present an implementation of the multilevel text processing model of discussions in USENET groups proposed earlier (NLDB '02 Proceedings). In the statistical processing phase, a discussion thread is SGML tagged to include the relevant information about parent-child relationships among the postings as well as other meta data of postings and threads. This tagged output is then processed by a generic information retrieval system. Various relevant metrics that measure properties of discussions (such as thread focus, relevance of posting, discussion density, etc.) are defined and computed. The subsequent semantic component (utilizing tools like electronic lexicons, POS taggers and parsers) has been implemented to work in a modular fashion to allow inclusion or exclusion of some of its subcomponents. The user may tune this module to its minimal level to process semantics of individual words only, or up to its maximal level to include words with their full contexts. We also present evaluation data assessing the performance of the system with or without some of its modules.
منابع مشابه
HUMAN COMMUNICATIONS ISSUES Topic Development in USENET Newsgroups
groups were developed for topics beyond the purely and directly scientific to allow discussion of more general topics. The familiar naming scheme developed, with newsgroups—at least nominally—devoted to scientific topics being preceded with ‘‘sci’’ (e.g., sci.physics) , recreational topics with the prefix ‘‘rec’’ (e.g., rec.motorcycles) , wordy discussions with ‘‘talk’’ (e.g., talk.origins) , a...
متن کاملUsenet newsgroups' profile analysis, utilising standard and non-standard statistical methods
متن کامل
An Empirical Exploration of Mass Interaction System Dynamics: Individual Information Overload and Usenet Discourse
The large-scale adoption of computer mediated communication technologies has resulted in what has been described as “mass interaction”, shared discourse between hundreds, thousands or more individuals. A number of theoretical papers have made the argument that because of the existence of various technological and psychological constraints, the forms that mass interaction takes, can, partly be u...
متن کاملSocial Roles in Electronic Communities
Individuals’ behavior in groups is constrained by several factors, including the skills, privileges and responsibilities they enjoy. We call these factors a social role, and explore using the concept of social roles as an analytical tool for studying communities in Usenet newsgroups. Our understanding of what roles are and how they function is derived from sociolinguistics, social psychology, a...
متن کاملMass Interaction, Information Overload and Computer Mediated Communication Tools
The large-scale adoption of computer mediated communication technologies has resulted in what has been described as “mass interaction”, shared discourse between hundreds, thousands or more individuals. The emergence of mass interaction presents new opportunities to learn about and understand human communication, and information technologies. A number of theoretical papers suggest that the forms...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003